Adjacency and Proximity Searching in the Science Citation Index and Google

نویسندگان

  • Ronald N. Kostoff
  • John T. Rigsby
  • Ryan B. Barth
چکیده

We have developed simple algorithms that allow adjacency and proximity searching in Google and the Science Citation Index (SCI). The SCI algorithm exploits the fact that SCI stopwords in a search phrase function as a placeholder. Such a phrase serves effectively as a fixed adjacency condition determined by the number n of adjacent stopwords (i.e., retrieve all records where word A and word B are separated by n words in at least one location). The algorithm integrates over search phrases with different numbers of adjacent stopwords to provide a flexible adjacency or proximity capability (i.e., retrieve all records where word A and word B are separated by n or less words in at least one location, where n is the maximum separation desired between A and B in at least one location). The Google algorithm exploits the fact that asterisks (in Google) separating words in a phrase function like word wildcards. The difference between two such phrases (the first phrase containing one less asterisk than the second phrase) serves effectively as a fixed adjacency or proximity condition, with the number of separating words equal to the number of asterisks in the first phrase. The algorithm integrates over these phrase differentials to provide a flexible adjacency or proximity capability (i.e., retrieve all records where word A and word B are separated by n or less words in at least one location, where n is the maximum separation desired between A and B in at least one location). Ronald N. Kostoff, John T. Rigsby, and Ryan B. Barth 2 Journal of Information Science © CILIP 2005

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Brief Communication Adjacency and proximity searching in the Science Citation Index and Google

We have developed simple algorithms that allow adjacency and proximity searching in Google and the Science Citation Index (SCI). The SCI algorithm exploits the fact that SCI stopwords in a search phrase function as a placeholder. Such a phrase serves effectively as a fixed adjacency condition determined by the number n of adjacent stopwords (i.e., retrieve all records where word A and word B ar...

متن کامل

Investigating the Effect of Spatial Proximity on Iran University- Industry Co-publications by using Gravity Model

Background and Aim: Due to the importance of scientific relations between university and industry, it is so important to identify the factors that affect these relations. So,the aim of this study is to investigate the effect of spatial proximity on university- industry collaboration. The collaboration indicator which is used here is University- Industry Co-publications. Methods: The research is...

متن کامل

Citation Analysis of Iranian Journal of Basic Medical Sciences in ISI Web of Knowledge, Scopus, and Google Scholar

    Objective(s): Citation tracking is an important method to analyze the scientific impact of journal articles and can be done through Scopus (SC), Google Scholar (GS), or ISI web of knowledge (WOS). In the current study, we analyzed the citations to 2011-2012 articles of Iranian Journal of Basic Medical Sciences (IJBMS) in these three resources.   Material and Methods: The rel...

متن کامل

Does it Matter Which Citation Tool is Used to Compare the h-index of a Group of Highly Cited Researchers?

h-index retrieved by citation indexes (Scopus, Google scholar, and Web of Science) is used to measure the scientific performance and the research impact studies based on the number of publications and citations of a scientist. It also is easily available and may be used for performance measures of scientists, and for recruitment decisions. The aim of this study is to investigate the difference ...

متن کامل

SAVVY SEARCHING Deflated, inflated and phantom citation counts

Purpose – The purpose of this paper is to clarify some issues regarding citation indexing, analysis and searching. Design/methodology/approach – The paper begins with a discussion on an article in the D-Lib Magazine and then focuses on deflated citation counts and inflated and phantom citation counts. Findings – The combination of the inflated citation count values dispensed by Google Scholar (...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006